Extended Version of Parallel Processing Conf. 1992 Paper and Revised Version of Stony Brook Tr # 92/01 Eager Sharing for Eecient Massive Parallelism

نویسندگان

  • Larry D. Wittie
  • Gudjon Hermannsson
چکیده

{ Workstation networks can become tera-FLOPS supercomputers by adding highspeed interfaces supporting selective eager sharing. For Gaussian elimination and fast Fourier transform, selective eager sharing is much more eecient than global sharing of all data changes, and average eeciency remains above 60% for thousands of processors. Prototype SESAME interfaces will share data at 50 megabytes/second among more than 100 workstations. Propagation delays are typically 0:8 microseconds and overlap computations. All shared data reads are quick local accesses. Eager sharing supports diiuse non-local accesses in ne-grained parallel programs much more eeciently than demand driven cache protocols. Future massively parallel supercomputers should ooer eagersharing coherence mechanisms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook Stony Brook the N=2(4) String Is Self-dual N=4 Yang-mills

N=2 string amplitudes, when required to have the Lorentz covariance of the equivalent N=4 string, describe a self-dual form of N=4 super Yang-Mills in 2+2 dimensions. Spin-independent couplings and the ghost nature of SO(2,2) spacetime make it a topological-like theory with vanishing loop corrections.

متن کامل

EXECUTING NESTED PARALLEL LOOPS ON SHARED - MEMORYMULTIPROCESSORSSadun

Cache-coherent, bus-based shared-memory multiprocessors are a cost-eeective platform for parallel processing. In scientiic parallel applications, most of the computation involves processing of large multidimensional data structures which results in a high degree of data parallelism. This parallelism can be exploited in the form of nested parallel loops. Most existing shared memory multiprocesso...

متن کامل

Synchronous Parallel Discrete Event Simulation on Shared-Memory Multiprocessors

This paper describes the implementation and studies the performance of a synchronous , parallel discrete event simulation (SPDES) method on a shared memory multiprocessor. The presented method aims at the eecient simulation of architectural designs for which the asynchronous PDES methods seem to be less eeective. A multiprocessor machine is simulated, and the performance achieved is compared to...

متن کامل

Measuring Hospital Performance Using Mortality Rates: An Alternative to the RAMR

Background The risk-adjusted mortality rate (RAMR) is used widely by healthcare agencies to evaluate hospital performance. The RAMR is insensitive to case volume and requires a confidence interval for proper interpretation, which results in a hypothesis testing framework. Unfamiliarity with hypothesis testing can lead to erroneous interpretations by the public and other stakeholders. We argue t...

متن کامل

STATE UNIVERSITY OF NEW YORK AT STONY BROOK CEAS TECHNICAL REPORG Optimal Load Sharing for a Divisible Job on a Bus Network

Optimal load allocation for load sharing a divisible job over N processors interconnected in bus-oriented network is considered. The processors are equipped with front-end processox:s. It is analytically proved, for the first time, that a minimal solution time is achieved when the computation by each processor finishes at the same time. Closed form solutions for the minimum finish time and the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992